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1.  Abstract/Introduction  (SF298  requirement) 

Systemic  lupus  erythematosus  (lupus)  is  a  potentially  deadly  systemic  autoimmune  disease 
that  disproportionately  afflicts  women  and  African-Americans.  This  project  is  designed  to 
discover  genes  that  increase  the  risk  of  lupus  in  African-American.  Our  goal  was  to  expand  the 
genotyping  density  by  genotyping  a  subset  of  our  African-American  lupus  cases  and  controls  on 
the  OMNI-1S  platform  and  then  to  exploit  this  genotyping  with  sequencing  and  follow  up 
genotyping  in  an  effort  to  identify  the  genes  that  alter  disease  risk.  Two  years  ago  we 
abandoned  our  hope  of  genotyping  the  control  samples  from  Detroit  and  instead  found  and 
have  already  genotyped  3000  African-American  controls  on  the  OMNI-Express.  (These 
reagents  were  purchased  by  our  collaborator.)  At  this  point  the  genotyping  is  completed  and  we 
are  working  on  the  quality  control  and  data  analysis.  As  we  work  through  the  remaining  issues 
(population  stratification,  imputation,  and  detailed  exploration  of  positive  findings)  we  are 
hopeful  to  be  composing  our  genome  wide  association  study  manuscript  in  the  months  to  come 
and  then  turn  our  attention  to  the  follow  up  studies  with  sequencing  data  and  follow  up 
genotyping.  The  project  now  appears  to  be  spectacularly  productive  with  a  substantial  number 
of  previously  unknown  genetic  associations  that  are  newly  detected  by  this  project. 

2.  Body 

Since  the  population  history  of  African  ancestry  appears  to  reach  back  in  time  much  further 
to  the  most  recent  small  founder  population  (-200,000  years)  than  the  other  major  human 
ancestries  (<50,0000  years  for  Asian,  European,  or  Amerindian),  the  extent  of  linkage 
disequilibrium  is  much  lower  in  population  samples  of  African  ancestry.  This  means  that  the 
usual  approach  for  finding  genetic  association  using  haplotype  block  tagged  markers  will  be  less 
successful  in  this  ancestry.  One  way  to  partially  compensate  for  this  problem  is  to  increase 
marker  density,  which  is  what  we  are  funded  to  do  in  this  project  in  our  genetic  study  of 
systemic  lupus  erythematosus  (lupus). 

Lupus  in  African-Americans  is  more  severe  and  more  deadly  than  in  other  populations,  and 
especially  so  compared  to  European-Americans.  Indeed,  lupus  afflicts  women  ten  times  more 
frequently  than  men  with  a  strong  tendency  to  strike  during  the  child-bearing  years  and  is 
relatively  common  among  the  Active  Duty  Military. 

3.  Key  Research  Accomplishments 

•  Our  DOD  project  is  a  component  of  a  larger  project  to  more  fully  characterize  African- 
American  genetic  association  with  lupus  by  genome  wide  association.  Genome  wide 
association  genotyping  is  completed.  We  continue  the  intensive  evaluation  of  the  data, 
removing  markers  and  samples  that  appear  to  have  the  potential  of  artifact  from 
stratification,  poor  clustering,  batch  effects,  reduced  marker  calling,  and  disproportions  in 
controls.  At  this  point  we  are  working  with  data  from  all  sources  from  7,300  DNA 
samples.  Of  these  this  project  contributed  574  controls  and  1,590  cases  evaluated  on 
the  OMNI-1S  platform.  These  results  are  complemented  by  genotyping  from  434 
controls  and  2,359  cases  on  the  OMNI-1  platform.  (Together,  the  OMNI-1S  and  OMNI-1 
are  almost  equivalent  to  the  OMNI-2.5.)  We  have  1 ,536  controls  available  with 
genotyping  from  the  OMNI-2.5  platform,  available  from  dbGaP.  In  addition,  we 
genotyped  3,985  African-American  controls  on  the  OMNI-Express  platform  in  a 
collaboration  with  Mt.  Sinai  in  New  York  City.  The  merged  data  provide  nearly  3  million 
single  nucleotide  markers  from  7,300  subjects  for  a  dataset  that  will  have  the  power  for 
inquiry  equivalent  to  nearly  22  billion  genotypes  when  the  imputation  across  the 
platforms  is  completed. 
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•  We  have  discovered  presumed  artifacts  that  originate  from  batch  effects  between 
controls  sets,  which  we  have  eliminated  from  the  data. 

•  The  major  issue  has  been  that  the  imputation  of  genotypes  against  the  1000  genomes 
project  was  technically  difficult  to  perform  and  has  required  a  year  of  diligent  effort  to 
solve.  As  a  consequence  we  have  an  abundance  of  positive  associations  that  do  not 
appear  to  be  robust.  Consequently,  we  submitted  a  request  for  extension  without 
additional  funds  in  order  to  address  the  issues  raised  by  these  results,  which  was 
submitted  to  the  DoD  on  27  August,  2013.  We  are  awaiting  a  decision  on  this  request. 
The  following  four  bullets  are  taken  directly  from  that  request. 

•  The  delays  in  our  plans  came  from  the  genotyping  data  having  been  performed  on  four 
different  array  platforms.  This  made  the  imputation  of  genotyping  very  difficult,  crashing 
our  computer  system  repeatedly  and  requiring  much  sophisticated  informatic 
intervention  and  experimentation.  Indeed,  we  started  the  imputation  in  September  2012 
and  did  not  finish  the  somatic  chromosomes  until  July  2013.  The  X-chromosome  is  now 
being  imputed  and  we  hope  will  be  completed  before  the  end  of  September. 


-  Imu  «  * 


■  a  r  t  i-  b  ii  ir  i>  »  n  h  v  ■  i  s  a  ? 


F  ig  u  re  1,  Th  e  -log  { p)  of  associaion  is  p  lotted  (0  to  340)  ag  ainst  th  e  g  eno  me  position 
[somatic chromosome  number i  for thege retyped  and  imputed  results  before  data 
cl  e  a  nirg  .Ours  uspicion  is  that  m  a  ny  of  th  ese  p  urp  orted  assodati ons  are  s  pu  rio  us  an  d 
tli  at  only  a  subset  are  robust.  The  experiments  proposed  v;  ill!  attempt  to  identify  such 
robust  assotiadons. 

•  We  cannot  order  the  confirmatory  reagents  until  we  are  settled  on  what  results  need  our 
experimental  attention.  At  this  point,  we  are  close  to  knowing  what  results  need  closer 
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experimental  scrutiny,  but  we  are  not  quite  ready,  given  the  still  pending  X  chromosome 
results.  We  have  saved  some  funding  for  this  purpose  and  would  intend  to  do  this  and 
any  sequencing  that  we  might  be  able  to  afford  from  the  remaining  funds. 

•  The  outstanding  issue  has  to  do  with  results  that  are  imputed.  With  a  small  error 
accompanies  imputation,  which  is  compounded  when  there  is  any  issue  with  the 
genotyping  data  upon  which  the  imputation  is  based.  Consequently,  we  approach  the 
imputation  results  with  some  reservation  that  they  are  correct.  Since  we  have  so  many 
apparently  positive  results  we  evaluated  these  data  to  find  the  most  trustworthy  results 
that  we  would  then  intend  to  publish.  Figure  1  presents  the  initial  imputation  results  for 
the  somatic  chromosomes.  We  imposed  additional  data  quality  criteria  on  these  results 
requiring  that  there  be  a  90%  call  rate,  that  Hardy  Weinberg  proportions  have  p>0.001  in 
the  controls,  that  the  samples  done  with  OMNI-1  controls  did  not  differ  from  the  other 
controls  by  p<0.001,  that  there  be  a  genotyped  marker  with  an  association  of  p>0.001 
that  is  within  20  kb  that  is  in  disequilibrium  with  an  r2>0.2.  After  the  loci  that  do  not  pass 
these  criteria  are  removed,  we  are  left  with  the  HLA  region,  which  contains  multiple 
imputed  associations,  and  100  others,  as  presented  in  Figure  2,  that  have  a  predicted 
genome  wide  imputed  association  (p<5x10"8).  These  results  are  extraordinary  and  we 
think  it  important  to  determine  how  many  of  these  are  robust,  which  we  would  test  by 
direct  genotyping  in  a  subset  of  the  subjects. 


Figure  2.  The  -leg  (pj  of  association  [Oto  29)  is  plotted  against  the  genome  position 
(s  o  m  atic  ch  rom  osom  e  n  un  ber)  f  o  r  th  e  g  e  notyp  ed  a  nd  im  puted  results  after  d  ata 
cl!  e  a  ning,  I  eaving  th  e  H  LA.  reg  ion  o  n  Ch  rom  osom  e  5  a  n  d  10 1  oth  er  g  e  no  me  v;  id  e 
5  ig  nff  icant  asaoci ons.  B I  ue  d  ots  a  re  im  puted  g  en  otyp  es  e  ach  the  other  zo  I  ors 
indicate  genotyped  loti  from  one  of  tin  e  three  contributing  arrays. 
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•  The  critical  experiment  that  we  hope  to  perform  with  the  extension  without  additional 
funds  (EWAF)  is  the  actual  genotyping  of  as  many  of  the  imputed  associations  as 
possible  in  as  many  subjects  as  we  can  afford.  Consequently,  we  will  use  our  remaining 
funds  to  perform  this  critical  experiment.  Perhaps,  we  will  be  able  to  establish  the 
validity  of  the  association  of  at  least  some  of  these  loci  (Figure  2)  with  actual 
experimental  data. 

•  Studies  published  this  past  year  have  presented  novel  findings  in  a  variety  of  settings. 
Molinaros  et  al  (1)  shows  how  applying  the  strategy  of  admixture  mapping  can  lead  to 
the  detection  of  genetic  effects,  in  this  case  showing  the  complex  relationship  of  IFIH1  to 
lupus.  Vaughn  et  al  (2)  reviewed  the  current  state  of  lupus  genetics  with  an  emphasis  on 
B  cell  signaling.  Namjou  et  al  (3)  discovered  a  unique  variant  of  the  ClqA  gene,  known 
in  the  null  form  or  when  anti-CIqA  autoantibodies  are  present  to  be  associated  with 
lupus.  Finally,  Kim  et  al  published  data  showing  that  the  ICAM1-OCAM4-ICAM5  locus  is 
associated  with  lupus. 

•  We  look  forward  to  completing  this  project  and  hope  that  we  will  have  a  comparatively 
productive  final  report  next  year. 
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5.  Conclusions 


The  planned  project  to  expand  the  density  and  number  of  markers  in  a  case-control  study 
of  African-American  systemic  lupus  erythematosus  is  proceeding  toward  completion. 
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